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Abstract 

We exhibit a polynomial reduction from the problem of finding a Nash equilibrium of a bimatrix 
game with rational coefficients to the problem of finding a Nash equilibrium of a bimatrix game with 0-1 
coefficients. 



1 Introduction 

We study the problem of finding Nash equilibria in two-player matrix games, a problem that has found 
increasing applications in both economics and the internet. It has been shown that it is #P-hard to count 
all Nash equilibria of a two-player game, even if all the matrix entries are or 1 [CS1]. However, the 
complexity of finding a Nash equilibrium is wide open, and has been proposed as one of the most important 
open problems in complexity theory today [Papl]. 

We give a polynomial reduction from finding Nash equilibria in general bimatrix games to finding Nash 
equilibria in games where all payoffs are either or 1, resolving an open problem posed in [CS2]. 

2 Definitions and General Lemmas 

Definition 1 (Game) A bimatrix game is a two-player game defined by a pair (R,C) where R and C are 
m x n matrices. R and C are the payoff matrices for the row and column players, respectively. When the 
game is played, the row player picks a row i to play and the column player picks a column j to play, and 
each player gets a payoff equal to the element (i,j) of his payoff matrix. 

The goal of the game is to maximize one 's expected payoff. 

Definition 2 A pure strategy for the row or column player is a row or column index of the payoff matrix, 
respectively. A (mixed) strategy is a probability distribution over the pure strategies, denoted by a vector 
x. To be a probability distribution, each entry must be in [0,1] and their sum must be 1. The support of a 
strategy Supp (x) is the subset of the pure strategies which the player sometimes plays. 

Note that the (expected) payoff for a player with payoff matrix M if the row player is playing strategy x 
and the column player is playing strategy y is x 1 My. 

Given such a game, a natural question that arises is what a "rational" player should do. The notion of 
rationality that has become widely accepted and almost ubiquitous is that of mutual best responses. The 
concept is that a player should play a strategy that maximizes his payoff, given what the other player is 
playing. 
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Definition 3 A strategy x is a best response (for the row player) to a strategy y if for every strategy x' , 
x' T Ry < x T Ry. 

Given such a game, one might ask what the outcome might reasonably be when both the row and column 
players are "rational" . In this situation, both players will be playing the best response to the other's strategy. 
This is exactly the notion that the pair of row and column strategies are in a Nash equilibrium. 

Definition 4 A Nash equilibrium is a pair of (mixed) strategies x* and y* such that each player's strategy 
is a best response to the other's, i.e. for any strategies x,y, 

x T Ry* < x* T Ry*, and x* T Cy < x* T Cy* 

Definition 5 A zero-sum game is a game in which R + C = 0, where by we mean the m x n zero matrix. 
More generally, a constant-sum game is one where R + C is an m x n matrix all of whose entries are the 
same. We also will consider the notions of a {0, 1} -game, where all payoffs are either zero or one. 

Note that constant-sum games are equivalent to zero-sum games, since subtracting the constant from all of 
the entries of one player's payoff matrix M gives a zero-sum game. We apply the following lemma: 

Lemma 6 The best responses for a player with payoff matrix M are the same as those if we replace the 
matrix by z(M — tr), where 1 is the m x n matrix with all entries equal to one, for any r, z € IK, z > 0. 

Proof: We have that 

x T z(M - rl)2/ = z(x T My - rx T ty) = z{x T My - r) 

so that the payoffs are simply shifted by a constant and then scaled by another constant, which by linearity 
does not affect the notion of a best response. I 

Lemma 1 x is a best response to y if and only if for every pure strategy e,, 

e i Ry < x Ry 

Proof: Suppose every pure strategy e, satisfied 

e i Ry < x Ry- 

Then by linearity of matrix multiplication, for an arbitrary x' = J2t=i x 'i e i, we have 

(m \ m m / m \ 

^2 x'fii J Ry = ^ x • (ei T Ry) < ^ x \ {x T Ry) = I ^ x'A (x T Ry) = x T Ry 

which implies that a; is a best response. The converse is obvious, since any pure strategy is a strategy. I 

The following corollary simplifies the process of testing whether a pair of strategies is a Nash equilibrium. 

Corollary 8 (x* , y*) is a Nash equilibrium if and only if neither player has a pure strategy that gives that 
player a higher payoff. 

In this paper, we consider the following problem: 

Given a bimatrix game G, find a Nash equilibrium 

which defines the complexity class NASH . We assume here that the elements of the matrices that define the 
game G are represented as rational numbers in the canonical way. A related complexity class is the class 
NASH{ 0]1 }, which is the subset of the class NASH where the matrices are restricted to have values in {0, 1}. 

The main result of this note is that there is a polynomial reduction from NASH to NASH{ 0]1 }. 



3 Reduction to Mimicking Games 

The first step of our reduction is the well-known reduction from general games (R, C) to mimicking games 
(M, I), where I is an identity matrix. These games are called "mimicking" because the payoff of the second 
player is non-zero iff she plays the same move as the first player. This implies the following simple lemma. 

Lemma 9 In any Nash equilibrium (x*,y*) of the mimicking game (M,I), Supp (y*) C Supp (x*). 

Proof: Suppose for the sake of contradiction that the second player sometimes plays a strategy not in the 
support of a;*. The second player will get payoff in this case, and could profitably change her play to mimic 
some strategy in x* . Thus (x* ,y*) is not a Nash equilibrium, the desired contradiction. I 

The next lemma trivially implies the reduction to mimicking games. 

Lemma 10 [CS2] For any bimatrix game (R, C) with R and C to x n matrices, there is a mimicking game 

( C T \ 
(M, I) with M = I _ _ I and I the (to + n) x (to + n) identity matrix such that the Nash equilibria of 

(R,C) correspond exactly to the Nash equilibrium strategies of the column player in the game (M,I). 

Proof: (sketch, see [CS2] for details) Given an equilibrium (x*,y*) of the game (M,I), we decompose y* 
as y* = (c*,r*) where r* and c* have to and n entries respectively. We then note that the condition that 
(x*,y*) is an equilibrium of (M, I) trivially implies the conditions that that make (r*,c*) an equilibrium of 
(R,C), after we scale r* and c* to have sum 1. 

Similarly, if (r*,c*) is an equilibrium of (R,C), we may scale the two vectors to vectors (ox*, /3c*) with 
a, (3 > so that the scaled incentives are equal: 

amax(Cr* ), = /?max(flc* ),, 

i i 

and so that (ar* , /3c*) sums to 1. Let y* = (ar* , /3c*) and let x* be uniform on the support of y* . It is 
straightforward to show that (x* ,y*) is a Nash equilibrium of the game (M, /). I 

The above lemma implies that NASH is equivalent the the mimicking-game version of NASH . 

4 Reduction to {0, 1} Games 

We now describe the reduction to {0, 1} games. The first ingredient of our construction is a method of 
representing each rational entry of a payoff matrix using only zeros and ones. 

We first note that given an n x n game (M,I), by Lemma 6 we can scale M by any positive constant, or 
add any number to its entries without changing the Nash equilibrium strategies. Thus we may put all the 
rational entries in M under a common denominator to produce a new matrix M' whose entries are integers. 
We further note that while each entry may now take more bits to express, the number of new bits that are 
needed per entry is at most the number of bits in the common denominator, which is at most the number 
of bits needed to express M. Thus the total number of bits has increased at most quadratically. 

We now have a game (M',I) where each entry is integral. Our strategy from here is as follows: we will 
express each entry in binary by replacing the n x n matrix M' with an kn x kn block matrix M" , for some 
k greater than the binary length of any entry in M' . We then encode each entry of M' into a binary string, 
and place it in the corresponding k x k block of M" , using the rest of the entries to ensure that the binary 
string is correctly "interpreted" as representing an integer payoff. 



We now describe the part of the construction that enforces this interpretation: a (k — 1) x (k — 1) matrix 
G with the property that the game (G,I), for a (k — 1) x (k — 1) identity matrix 7, has a unique Nash 
equilibrium (r*,c*), and furthermore, there exist ^p columns whose probabilities of being played are in the 
ratio 
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This property is proven in the following lemma. 
Lemma 11 Define matrices A,B as 
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define the pp x K^- matrix G 1 to have the following j x j block form: 
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Explicitly, G' has block B on the minor diagonal, block A above, and below. Further, let G 



1-G' 



G' 

Then the game (G,I) (equivalent to the constant-sum game (G',1 — G 1 ) has a unique equilibrium (r*,c*) 
with 

r' = £(l,W) 



,2-2-> 



-v, ■ 



2-i 



where v 



V 3-2J -3 3-2J : -3 
i (2J- 1 ,2-»- 1 ,2-»- 1 ,2-»- 2 ,2-»- 2 ,2-»- 2 ,...,2,2,2,l,l,l) 
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Proof: Consider the game (G 1 , 1 — G 1 ). This will have a Nash equilibrium (x,y) with full support only if 
all entries of G'y T and all entries of x(l — G 1 ) are equal. 

We show by induction that these constraints are satisfied iff both x and y are scaled versions of v. Suppose 
we know the first 2>i entries of y are in the proportions of v. Then in the i + 1st block row of G' , we 
have three rows, which must all have equal incentives for the row player since the row player's strategy has 
full support, by hypothesis. For each of those rows, the incentives from the A-blocks are the same by the 
inductive hypotheses, so we must have that the payoffs from the B-block are also the same, i.e. 

2/3J+1 + V3i+2 = 2/3J+1 + V3i+3 = V3i+2 + 2/3J+3, 

which implies y3i+i = y3t+2 = 2/3J+3- To show the ratio of 2 : 1 between adjacent blocks, we note that the 
payoff for this block of rows must be the same as that for the previous one; the payoffs of the two blocks 
differ by 2y 3i — (y 3i + 2y 3i+3 ); setting that to zero proves the inductive step. Thus we have shown that the 
entries of y are in the ratio specified by v. 



Clearly, the same argument applies to x. Thus since both x and y must sum to 1, i 
Nash equilibrium with full support. 



y = v is the only 



We now note that the game (G',1 — G') is in fact a constant-sum game, so its Nash equilibria are the 
solutions to a linear program. This implies that the set of Nash equilibria is convex. If we suppose for the 
sake of contradiction that there is another Nash equilibrium in addition to the one x = y = v, then all linear 



combinations of these two equilibria must also be equilibria, and hence by standard topology arguments 
there are a continuum of full support equilibria, which contradicts the uniqueness argument of the previous 
paragraph. 

Thus x = y = v is the unique equilibrium for the game (G 1 , 1 — G'), and by lemma 10 the game (G,I) has 
the unique equilibrium described above. I 

Thus we can express integers t in our game by representing them in binary as {tj-\...t\to), and putting digit 
ti in column 3(j — i) of a matrix based on G. 

We now show how to embed the matrix G in larger games so as to allow the binary representation described 
at the beginning of this section. 

Given a game (M', I), with M' an n x n matrix with integral entries, construct the kn x kn {0, 1} -matrix 
M" as follows. Construct the k — 1 x k — 1 matrix G defined in the above lemma, and append a column of 
k — 1 ones to the right end to create afc-lxl: matrix G. Place this matrix along the main diagonal of M" , 
(with the upper-left corner on the diagonal) filling the rest of these (k — l)n rows with zeros. Note that this 
leaves n rows unaccounted for. Since M" may be considered as a n x n block matrix, we fill in each block's k 
unspecified entries with the binary expression of the corresponding entry in the nxn matrix M' , putting the 
ith digit of entry M' rs in M' kr k(s _ 1 - ]+3(j _ i - ] as described above. We make the slight modification of making 
first two entries in these rows one, M' kr k(s _ 1 - ]+1 = ^kr k(s-i)+2 = -*■' so tnat the num ber represented is 
actually M' r s + 2 J . The rest of the entries are 0. 

We prove the following lemma, which implies our main result. 

Lemma 12 Given any game (M,I) with M rational, construct the game (M",I') by first rescaling M to 
an integral matrix M' , and then shifting M' so as to make its entries integers in the range < M' r s < 2 J for 
some j. Let k = 6j + 1, and construct the matrix M" as above. Then, up to scaling, the Nash equilibrium 
strategies for the column player of the game (M,I) are identical to the elements (l,fc + 1, ..., (n — l)k + 1) 
of the Nash equilibrium strategies for the column player of the {0, 1} -game (M",I'). 

We note that since the size of M" is polynomial in the number of bits used to express M, and lemma 10 proves 

( C T \ 
that a mimicking game M = I I may be constructed so that the Nash equilibrium strategies for 

its column player correspond exactly to the Nash equilibrium strategies for both players in the game (R, C), 
this lemma trivially implies the following theorem. 

Theorem 13 NASH = NASH {0>1} . 

We now prove the lemma. 

Proof: Consider any Nash equilibrium (x 1 ,y') of the game (M" ,/'). Motivated by the block decomposition 
of M", we consider y' in blocks of k. Recall from the construction of M" that each occurrence of G stands 
alone in its corresponding rows, and that the corresponding entries in I' form a^-lxfc-1 identity matrix 
with a column of zeros added. A straightforward application of the definition of a Nash equilibrium reveals 
that the k — 1 corresponding weights in y' are either all zero, or are a scaled Nash equilibrium for the game 
(G,I). Thus these k - 1 weights {y[ r _ 1)k+i }i=i equal y'^(2^- 1 ,2^>- 1 ,2^- 1 , ..., 1, 1, 1, ...) for some y'J > as 
shown in lemma 11. Thus we have found a block representation for any Nash equilibrium strategy y' . Note, 
however, that we have not yet discussed the n entries y', r _ 1 \ k+k = y' rk - 

Recall from lemma 9 that y', r _ 1 \ k+i > implies that x l ,_ 1 - ]k+i > 0, which implies from the definition of a 

Nash equilibrium that the ((r — l)k + i)th entry of M"y' is at least as big as any other entry. We apply 
this technique to prove a sequence of useful results. 



Suppose for the sake of contradiction that for every r, y" = 0. This means that the only nonzero entries 
are those of the form y' rk . Note, however, that this implies that the (rfc)th entries of M"y lT are all 0, since 
for any r, s, M" k sk = 0. However, the (rfc)-th columns have ones everywhere else, so every other row gets 
positive payoff. Thus this is not a Nash equilibrium. We conclude that some y" > 0. 

For some y" > 0, consider the payoffs of the k rows (r — l)k + 1, ..., rk. From the construction of the matrix 
G, we conclude that the first k — 1 of these payoffs equal 2 J y" + r' rk and that the last payoff is at least the 
sum of the entries in the k corresponding columns: (M' rr + 2 : >)y' 1 ! , where by construction, M' rr > 0. Thus 
this may be a Nash equilibrium only if r' rk > 0. Thus whenever y" > 0, we must have y' rk > 0. The crucial 
consequence of this is that the payoff of the (rfc)th row must now be optimal by the mimicking argument. 

Note that the incentive of the (rfc)th row is just 

Y,(K' + *)v"> (!) 

s 

which equals the incentive in the game (M 1 + 2 J , I) when the second player plays strategy y" (up to scaling) . 
Since as noted above, for every nonzero y", the corresponding row must have optimal incentive, we conclude 
that the strategy y", properly scaled, is in fact a Nash equilibrium of the game (M, I). We have proven one 
direction of the correspondence. 

The other direction is fairly straightforward. Given a Nash equilibrium (x, y) of the game (M, I), let y" = ay 
for some scaling constant a, and let 

{ylr-Dk+i^^Vr^- 1 ,^- 1 ,^- 1 ,..., 1,1,1,...), 

as above. From equation 1 we see that all the optimal incentives in (M, I) will remain optimal in (M",I') 
when we restrict our attention to rows rk. Further, each k — 1 block of rows will have equal payoffs since 
their corresponding columns have probabilities proportional to the full-support Nash equilibrium of the game 
(G,I). To make all these blocks have equal payoffs, we need only pick the additive constants y' rk so that 
the total payoffs are equal. We then scale these y' rk and a so as to make ^ y\ = 1, and we have a Nash 
equilibrium, as desired. 

Thus we have constructed the desired correspondence between the column player strategies in Nash equilibria 
of the games (M,I) and {M" ,!'). I 
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